Method and System to Combine Keyword Results and Natural Language Search Results

ABSTRACT

A search query is received from a single input field of a user interface. A keyword search is performed based on the search query to generate keyword search results. A natural language search is performed of a frequently-asked question (FAQ) database based on the search query to generate FAQ search results. The keyword search results and the FAQ search results are combined in a display page.

CLAIM OF PRIORITY

This application is a continuation application of, and claims priority to, U.S. patent application Ser. No. 10/975,023, filed Oct. 27, 2004, the contents of which are expressly incorporated herein by reference in their entirety.

BACKGROUND

A challenge for a developer of a Web site is predicting what users will want to do on the Web site and whether all users will understand the names of the links provided on the Web site. Since some users prefer to navigate a Web site by typing what they are looking for, many Web sites provide a search engine to allow users to find what they are looking for using their own terminology. Allowing users to find what they are looking for quickly and easily often translates into increased sales and/or reduced support costs.

The development of natural language search algorithms has made use of natural language searching of Web sites more feasible and popular. A natural language search engine receives full sentences (which may be in English or another language) as typed or otherwise entered by users, and searches a set of data based on both the words in the sentence and the construction of the sentence. Natural language search engines are often used to retrieve answers to frequently asked questions (FAQs) by searching a FAQ database including a number of questions and answers.

Keyword search engines are also provided by many Web sites. A keyword search engine searches an entire Web site for pages and other resources related to user-entered keywords.

When both a natural-language FAQ search feature and a keyword search feature are provided by a Web site, the user may become confused as to which search engine to use. Further, the placement and naming of the two search features can become problematic for a developer of the Web site.

Further, search results are generally present in a linear list with multiple types of results being intermixed. Some of the search results may have little bearing on the user's intended search. For example, a user searching on “DSL” may be presented search results related to sales of DSL, technical aspects of DSL, technical support for DSL and other DSL-related results, all mixed together without preference given to the user's area of focus. Recent research has pointed to the use of categorization of search results as a method to allow users to more quickly and easily find the results they are looking for.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention is pointed out with particularity in the appended claims. However, other features are described in the following detailed description in conjunction with the accompanying drawings in which:

FIG. 1 is a flow chart of an embodiment of a method of providing combined search results;

FIG. 2 is a block diagram of an embodiment of a system for providing combined search results;

FIG. 3 is a screen shot of an embodiment of the user interface in FIG. 2;

FIG. 4 is a screen shot of an embodiment of the first display page in FIG. 2;

FIG. 5 is a screen shot of an embodiment of the second display page in FIG. 2;

FIG. 6 is a screen shot of an embodiment of the first display page in FIG. 2 that includes keyword search results but not FAQ search results; and

FIG. 7 is a screen shot of an example of a detailed search results page that includes a drop-down menu for modifying a category.

DETAILED DESCRIPTION

Tests have shown that about half of sampled users prefer keyword searches, and about another half prefer natural language searches in some applications. To satisfy all users, embodiments of the present disclosure perform both types of searches based on a search query entered into a single input field, and combine the two resulting types of search results into a single display page.

To assist users in focusing on particular areas of interest in the search results, the display page categorizes the keyword search results and the FAQ search results into a plurality of categories. The types and number of categories are determined using a card sort technique. In the above-described DSL example, the display page may categorize the DSL search results into a products and services category, a billing and account category, a help and support category, and a corporate information category. The user can view and directly expand on the help and support category, for example, from the display page to retrieve the information he/she is looking for.

Embodiments of the present disclosure are described with reference to FIG. 1, which is a flow chart of an embodiment of a method of providing combined search results, and FIG. 2, which is a block diagram of an embodiment of a system to provide combined search results.

As indicated by block 10, the method comprises receiving a search query 12 from a single input field 14 of a user interface 16. The user interface 16 may be provided by a search service that provides search results for multiple Web sites or by a page of a Web site that provides search results for other pages of the Web site. The user interface 16 may accompany the single input field with a textual display to clearly indicate to users that they may enter search queries that comprise either keyword(s) or a full question. In one embodiment, the textual display accompanying the single input field comprises “search or ask a question”.

As indicated by block 20, the method comprises performing a keyword search based on the search query 12 to generate keyword search results 22. The keyword search is performed by a keyword search engine 24. The keyword search engine 24 performs the keyword search of contents of one or more Web sites 26. If the user interface 16 is provided by a general purpose search service, the contents of one or more Web sites 26 comprise contents from multiple Web sites that have been indexed by the search service. If the user interface 16 is provided by a Web page to enable users to search within a single Web site, the contents of one or more Web sites 26 may comprise content only from multiple pages of the same Web site.

As indicated by block 30, the method comprises performing a natural language search based on the search query 12 to generate search results 32. The natural language search is performed by a natural language search engine 34. The natural language search engine 34 performs the natural language search of contents of a frequently-asked questions (FAQ) database 36. If the user interface 16 is provided by a Web page to enable users to search within a single Web site, the FAQ database 36 may comprise questions and associated answers that are frequently asked or anticipated to be asked by users of the Web site.

As indicated by block 40, the method comprises combining the keyword search results 22 and the FAQ search results 32 in a display page 44.

In one embodiment, the display page 44 is divided into two columns: a first column to contain some or all of the keyword search results 22 but none of the FAQ search results 32, and a second column to contain some or all of the FAQ search results 32 but none of the keyword search results 22.

Optionally, the display page 44 categorizes the keyword search results 22 and the FAQ search results into a plurality of categories. For purposes of illustration and example, the display page 44 is shown to have three different categories A, B and C, although any number of categories may be used. In this example, the keyword search results 22 are divided into keyword search results 50 in category A, keyword search results 52 in category B, and keyword search results 54 in category C. Similarly, the FAQ search results 32 are divided into FAQ search results 60 in category A, FAQ search results 62 in category B, and FAQ search results 64 in category C.

The particular categories A, B and C are application-dependent. In some applications, the categories may include a products and services category, a billing and account category, a help and support category, and a corporate information category. These categories may be of particular interest when the user interface 16 is part of a telecommunication service provider's Web site.

The categories may be determined using a card sort method prior to receiving the search query 12. The card sort method comprises creating a representative sample of some or all of the Web site content on cue cards. The cue cards may be either physical cards or virtual cards displayed using a computer. A first sample of individuals organize the cue cards into groups and name the groups in a manner that makes sense to them. The individuals may perform these acts either individually or collectively. If performed individually, a statistical cluster analysis is performed to determine an optimal set of groupings that accommodates as many of the individuals' groupings as possible. The naming of the groups in the optimal set are based on the names provided by the first sample of individuals. The naming and nature of the groupings are validated using a reverse card sort. The reverse card sort comprises having a second sample of individuals, different from the individuals in the first sample, sort the cue cards into the defined groups. Unlike existing search engines that use company created categories, the card sort method generates search categories that match users' mental models.

In general, the layout of the search results on the display page may be determined by performing one or more usability tests prior to receiving the search query 12. Based on the results of a usability test for a layout, a modified layout may be determined and subjected to a subsequent usability test. This process may be repeated until a desirable layout is achieved.

In one embodiment of the layout, the display page 44 is divided into rows, where each row is dedicated to display keyword search results and FAQ search results for an associated one of the categories. For example, the keyword search results 50 in category A and the FAQ search results 60 in category A are displayed in a first row dedicated to category A. The keyword search results 52 in category B and the FAQ search results 62 in category B are displayed in a second row dedicated to category B. The keyword search results 54 in category C and the FAQ search results 64 in category C are displayed in a third row dedicated to category C.

The display page 44 may display all of the keyword search results and/or FAQ search results in a particular category, or may display only a subset of the keyword search results and/or FAQ search results in a particular category. In one embodiment, at most three keyword search results and at most three FAQ search results are displayed in each particular category. In this embodiment, the top one, two or three keyword search results and the top one, two or three FAQ search results (based on the search query 12) may be displayed in each particular category.

Presenting the top three results for each category directly on the first display page 44 promotes user understanding of the nature of each category based on both the displayed name of the category and the search results in the category. Further, the user is allowed to find links to the particular result(s) of interest to him/her directly from the first display page 44. These features distinguish from existing search engines that show only company-created category names on the front page that require users to guess the meaning of each category and to go to a subsequent page to see the results of a category.

The order of the categories in the display page 44 may be either fixed or variable. Optionally, the order of the categories varies based on how well the results in each category match the search query 12. In one embodiment, the category whose results best match the search query 12 is displayed in the first row, followed in the second row by the category whose results have the second best match with the search query 12, and so on. For example, if the search query 12 is “slow DSL”, the “help and support” category may be presented first in a list of categories because the best matches for “slow DSL” are within the “help and support” category. When viewed from top to bottom, the display page 44 displays categories with decreasingly better matches in this embodiment.

Optionally, the display page 44 also includes the user interface 16 so that the user can refine his/her search or enter a different search query.

As indicated by block 66, the method optionally comprises including in the display page 44 a user-selectable command to expand a number of keyword search results and/or FAQ search results that are displayed together in a category. In the example of FIG. 2, the display page 44 may further comprise user-selectable commands 70, 72 and 74 to expand a number of keyword search results and FAQ search results that are displayed together in categories A, B or C, respectively.

As indicated by block 76, the method comprises outputting a signal to display the display page 44 to the user who entered the search query 12. The display page 44 may be encoded in a markup language such as Hypertext Markup Language (HTML) or an alternative type of computer code.

As indicated by block 80, the method optionally comprises outputting another display page 82 that expands a number of keyword search results and FAQ search results that are displayed together in a category in response to a user selection of one of the commands 70, 72 or 74. For purposes of illustration and example, consider the user selecting the command 74. In response thereto, expanded keyword search results 84 in category C and expanded FAQ search results 86 in category C are displayed in the display page 82. In one embodiment, the display page 82 includes ten keyword search results and ten FAQ search results from the user-selected category.

Optionally, each of the display pages 44 and 82 includes a header 90 and 92, respectively, having a user-selectable option to view questions in a user-selected one of the categories. The questions may comprise the most frequently asked and/or most relevant questions from the FAQ database 36 that are in the user-selected category. The header 90 and 92 provides an alternative for browsing the questions and answers in the FAQ database 36 if, for example, the user is having difficulty with the search or wants to see what other related information is in the FAQ database 36.

Optionally, the display page 82 also includes a user-selectable drop-down menu (such as a drop-down box, for example) with the other categories so that users may view and expand the view of their search results in another category without having to return to the display page 44 and select another option (such as the command 70 or the command 72, for example).

Optionally, the display page 82 also includes the user interface 16 so that the user can refine his/her search or enter a different search query.

The herein-disclosed system can be embodied by a computer system comprising one or more computers. The acts performed by each of the herein-disclosed components can be directed by respective computer program code embodied in a computer-readable form on a computer-readable medium. Each of the herein-disclosed components may comprise a respective computer processor responsive to the computer program code to perform the acts.

FIG. 3 is a screen shot of a user interface 16′ that is an embodiment of the user interface 16 in FIG. 2. The user interface 16′ has a single entry field 14′ accompanied by an instruction to the user to “search or ask a question”. Based on the instruction, users clearly recognize that they may use the single entry field 14′ to type either one or more keywords or a full question, thus eliminating a need for users to choose which search tool to use. For purposes of illustration and example, consider a user entering a search query of “long distance” into the single entry field 14′. The keyword search engine 24 searches the contents of a telecommunication service provider's Web site to find Web pages related to “long distance”. The natural language search engine 34 searches the FAQ database 36 to find questions and/or answers related to “long distance”.

FIG. 4 is a screen shot of a display page 44′ that is an embodiment of the display page 44 in FIG. 2. The display page 44′ is a search results page that combines keyword search results (e.g. the Web pages related to “long distance” in the telecommunication service provider's Web site) and FAQ search results (e.g. questions and/or answers from the FAQ database 36 that are related to “long distance”). Presenting both sets of results on the first page increases the likelihood that the user will find what he/she is looking for without having to navigate to other pages of search results, and reduces a number of pages to be maintained by developers.

The display page 44′ is divided into rows and columns to distinguish between different categories of search results (e.g. products and services, billing and account, and help and support) and different types of search results (e.g. keyword and FAQ results), respectively. Hyperlinks 70′, 72′ and 74′ are embodiments of the commands 70, 72 and 74, respectively, to expand the search results in a respective category. For purposes of illustration and example, consider the user clicking on or otherwise selecting the hyperlink 70′ for more products and services results.

FIG. 5 is a screen shot of a display page 82′ that is an embodiment of the display page 82 in FIG. 2. The display page 82′ is displayed to the user in response to the selection of the hyperlink 70′. The display page 82′ lists an expanded number of keyword search results and an expanded number of FAQ search results in the products and services category for the search query of “long distance”.

With reference to FIGS. 4 and 5, the display pages 44′ and 82′ include headers 90′ and 92′, respectively, to view top questions for all categories, or for a particular category selected using a drop-down box.

Some of the herein-disclosed features may be applied to provide a search results user interface that does not combine keyword search results and FAQ search results. For example, FIG. 6 is a screen shot of a display page 44″ that is an embodiment of the display page 44 in FIG. 2 that includes keyword search results but not FAQ search results. For a search query of “DSL”, the display page 44″ provides DSL-related search results categorized into products and services results, billing and account results, help and support results and corporate information results. Links to the top three DSL-related results for each category are displayed, as well as links to get more results from any category. If a user wants to view more results for a section, he/she is taken to a subsequent results page with ten results per page. The subsequent results page enables the user to modify the category that he/she wishes to view in detail at any time using a drop-down menu at or near the top of the page.

FIG. 7 is a screen shot of an example of a detailed search results page 96 that includes a drop-down menu 98 for modifying the category. For purposes of illustration and example, the page 96 is generated for a user-entered search query of “call forwarding” and for a user selection of the help and customer support category. The drop-down menu 98 enables the user to jump from the page 96 directly to either a second detailed page dedicated to call-forwarding-related search results in the products and services category, a third detailed page dedicated to call-forwarding-related search results in the billing and account category, or a fourth detailed page dedicated to call-forwarding-related search results in the corporate information category. This feature distinguishes over existing search engines that require the user to go back to a main results page to select another category.

It will be apparent to those skilled in the art that the disclosed embodiments may be modified in numerous ways and may assume many embodiments other than the particular forms specifically set out and described herein.

The above disclosed subject matter is to be considered illustrative, and not restrictive, and the appended claims are intended to cover all such modifications, enhancements, and other embodiments which fall within the scope of the present invention. Thus, to the maximum extent allowed by law, the scope of the present invention is to be determined by the broadest permissible interpretation of the following claims and their equivalents, and shall not be restricted or limited by the foregoing detailed description. 

1. A computer-implemented method, comprising: receiving a search query from a single input field of a user interface; performing a keyword search based on the search query to generate keyword search results; performing a natural language search of a frequently-asked question (FAQ) database based on the search query to generate FAQ search results; outputting a first display page that combines the keyword search results and the FAQ search results, wherein the first display page categorizes the keyword search results and the FAQ search results into a plurality of categories, wherein each category of the plurality of categories includes a user-selectable command to output another display page that expands the combined search results; and in response to a selection of the user-selectable command associated with a particular category, outputting a second display page that expands the keyword search results and the FAQ search results associated with the particular category.
 2. The computer-implemented method of claim 1, wherein the plurality of categories are determined using a card sort method that includes creating a representative sample of web site content on a plurality of cue cards.
 3. The computer-implemented method of claim 2, wherein each cue card of the plurality of cue cards is assigned to a group by a first set of individuals and wherein each group is assigned a name by the first set of individuals.
 4. The computer-implemented method of claim 3, wherein each individual of the first set of individuals performs the group assignment and the naming assignment.
 5. The computer-implemented method of claim 4, wherein a statistical cluster analysis is performed to determine a set of groupings based on the group assignment and the naming assignment performed by each individual.
 6. The computer-implemented method of claim 3, wherein the group assignment performed by the first set of individuals is validated using a reverse card sort method.
 7. The computer-implemented method of claim 6, wherein a second sample of individuals that is different from the first set of individuals sort the cue cards into the groups assigned by the first set of individuals.
 8. The computer-implemented method of claim 1, wherein the keyword search results of the first display page are displayed in a first column and the FAQ search results of the first display page are displayed in a second column.
 9. The computer-implemented method of claim 8, wherein the keyword search results of the second display page are displayed in the first column and the FAQ search results of the second display page are displayed in the second column.
 10. The computer-implemented method of claim 8, wherein the combined search results associated with each category are displayed in different rows.
 11. The computer-implemented method of claim 10, wherein the combined search results that best match the search query are displayed in a first row and the combined search results with the next best match to the search query are displayed in a second row.
 12. A computer-readable storage medium comprising instructions that, when executed by a processor, cause the processor to: receive a search query from a single input field of a user interface; perform a keyword search based on the search query to generate keyword search results; perform a natural language search of a frequently-asked question (FAQ) database based on the search query to generate FAQ search results; output a first display page that combines the keyword search results and the FAQ search results, wherein the first display page categorizes the keyword search results and the FAQ search results into a plurality of categories, wherein each category of the plurality of categories includes a user-selectable command to output another display page that expands the combined search results; and in response to a selection of the user-selectable command associated with a particular category, output a second display page that expands the keyword search results and the FAQ search results associated with the particular category.
 13. The computer-readable storage medium of claim 12, wherein the plurality of categories are determined using a card sort method that includes creating a representative sample of web site content on a plurality of cue cards, wherein each cue card of the plurality of cue cards is assigned to a group by a first set of individuals and wherein each group is assigned a name by the first set of individuals.
 14. The computer-readable storage medium of claim 13, wherein the group assignment performed by the first set of individuals is validated using a reverse card sort method, wherein a second sample of individuals that is different from the first set of individuals sort the cue cards into the groups assigned by the first set of individuals.
 15. The computer-readable storage medium of claim 13, further comprising instructions that, when executed by the processor, cause the processor to: determine a first category that best matches the search query and a second category with the next best match to the search query; display the combined search results that best match the search query in a first row of combined search results; and display the combined search results with the next best match to the search query in a second row of combined search results.
 16. A system, comprising: a processor; computer program code executable by the processor to: receive a search query from a single input field of a user interface; perform a keyword search based on the search query to generate keyword search results; perform a natural language search of a frequently-asked question (FAQ) database based on the search query to generate FAQ search results; output a first display page that combines the keyword search results and the FAQ search results, wherein the first display page categorizes the keyword search results and the FAQ search results into a plurality of categories, wherein each category of the plurality of categories includes a user-selectable command to output another display page that expands the combined search results; and in response to a selection of the user-selectable command associated with a particular category, output a second display page that expands the keyword search results and the FAQ search results associated with the particular category.
 17. The system of claim 16, wherein the plurality of categories is determined using a card sort method that includes creating a representative sample of web site content on a plurality of cue cards, wherein each cue card of the plurality of cue cards is assigned to a group by a first set of individuals and wherein each group is assigned a name by the first set of individuals.
 18. The system of claim 17, wherein the group assignment performed by the first set of individuals is validated using a reverse card sort method, wherein a second sample of individuals that is different from the first set of individuals sort the cue cards into the groups assigned by the first set of individuals.
 19. The system of claim 16, wherein the keyword search results of the first display page are displayed in a first column and the FAQ search results of the first display page are displayed in a second column.
 20. The system of claim 19, wherein the combined search results associated with each category are displayed in different rows, wherein the combined search results that best match the search query are displayed in a first row and the combined search results with the next best match to the search query are displayed in a second row. 